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0 .  Summary 

The  following  researches  are  done  under  the  Grant  No.  AFOSR- 

86-0048  during  the  year  1986-87. 

(1)  Two  new  measures  are  proposed  to  identify  influential  sets 
of  observations  at  the  design  stage  in  view  of  prediction 
and  fitting.  A  relationship  is  established  between  one  of 
proposed  measures  and  the  Cook's  measure  at  the  inference 
stage . 

(2)  The  problem  of  measuring  dispersion  effects  at  different 
levels  of  factors  in  factorial  experiment  is  very  important 
in  quality  control  studies.  Assuming  that  for  the  fitted 
model  to  the  data  "here  is  no  significant  lack  of  fit,  we 
proposed  three  measures  of  dispersion  effects  at  levels  0 
and  1  of  factors  in  a  2m  factorial  experiment.  All  three  of 
them  are  relevant  in  replicated  factorial  experiments  and 
two  of  them  are  applicable  to  unreplicated  factorial 
experiments.  We  observe  that  the  measures  of  dispersion 
effects,  based  on  residuals  obtained  by  the  least  squares 
fit  of  the  model  to  the  data,  at  levels  0  and  1  of  a  factor 
are  correlated  in  most,  situations.  We  introduce  a  method  of 
adjusting  residuals  and  then  propose  measures  based  on 
residuals  and  adjusted  residuals. 

(3)  This  research  is  in  progress.  Ve  character ixe  designs  which 
enable  us  to  measure  and  compare  dispersion  effects  of 
level.-,  isv-'is.  We  obi.  a  i  n  Some  important  results  in  this 


area . 
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1 .  Research  Done 

1.1  Two  Methods  of  Identifying  Influential  Sets  of  Observations 


The  assessment  of  influence  of  a  set  of  observations  in  the 
analysis  of  data  is  important  not  only  at  the  inference  stage  but  also 
at  the  planning  (or  design)  stage.  A  set  of  observations  under  a 
design  is  said  to  be  influential  if  the  set  affects  not  only  the 
fitting  of  the  model  to  the  data  but  also  the  prediction  in  terms  of 
the 'fitted  model.  In  the  problem  of  identifying  sets  of  t  (a  positive 
integer)  Influential  observations,  we  assume  the  underlying  design  is 
robust  against  the  unavailability  of  any  t  observations  [Ghosh 
(1979)].  To  explain  this  concept  we  consider  the  standard  linear 
model 

E(^)  =  XJ3,  (1) 

V(X)  =  o2I,  (2) 

Rank  X  =  p,  (3) 

where  (Nxl)  is  a  vector  of  observations,  X(Nxp)  is  a  known  matrix, 

2 

JKpxl)  is  a  vector  of  fixed  unknown  parameters  and  o  is  a  constant 
which  may  or  may  not  be  known.  Let  d  be  the  underlying  design 
corresponding  to  The  design  d  is  assumed  to  be  robust  against  the 

unavailability  of  any  t  observations  in  the  sense  that  the  parameters 
in  jl  are  still  unbiasedly  estimable  when  any  t  observations  in  are 
unavailable.  There  are  I  ^ }  possible  sets  of  t  observations.  The  idea 
of  iobustness  of  designs  against  unavailability  of  data  is  fundamental 
in  measuring  the  influence  of  a  set  of  observations.  We  measure  the 
influence  of  a  set  of  t  observations  by  assuming  the  observations  in 
the  set  unavailable  and  then  assessing  the  model  fitted  with  the 
remaining  (N-t)  observations. 


V~V  v;-vv>?w >>>>>: 


»  V  V-  U  V  ' 
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Flrst  Method 

We  propose  the  first  measure  in  terms  of  precise  prediction  of  t 
unavailable  observations.  A  set  of  t  observations  is  the  most  in¬ 
fluential  if  the  model  fitted  with  the  remaining  (N-t)  observations 
doe&  the  worst  job  in  predicting  i_  unavailable  observations.  We 
denote  the  ith  set  of  t  observations  in  by  and  c^e  remaining 

observations  in  by  ^  ^  ;  the  corresponding  submatrices  of  X  by  X^^ 
and  X^^;  the  resulting  design  when  t  observations  in  the  ith  set  are 

unavailable  by  d^\  i  =  l,...,(^).  The  least  squares  estimators  of  J3 

(i)  “  -1 

under  d  and  d  are  3^  =  (X'X)  X'^_  and  ^(i)  = 

(xj^  xj^)  *x|*^  ^  ^  ‘  We  wr*te  the  values  of  £  under  d  and 


,(i) 


as 


^  "  XAi  a,ld  -!^U)  =  XAj(1) 


xc 


.  When  t  observations  in  the  ith 


are  unavailable,  the  predicted  values  of  unavailable  observations 
^2^  from  available  observations  are  the  elements  in  =  X^^B^i). 

The  reliability  of  these  estimators  can  be  judged  by  = 

-  a2 

(1) 


The  first  measure  of  influence  of 


*2 


is  def ined  as 


=  Trace  V  (4) 

Hie  smallest  value  of  I  i  =  l,...,l^),  for  i=u,  Indicates  that 

the  nth  set  of  t  observations  Is  the  least  Influential  in  terms  of 
precise  prediction  of  unavailable  observations.  On  the  otherhand  the 
largest  value  of  1  ^  f  ^  ^ .  l~l  , . . . ,  ) ,  for  i^w,  indicates  the  wth  set 

of  t  observations  is  the  most  influential. 

We  denote  the  1th  observation  in  by  y  and  the  ith  row  In  X  by 


j  ,  i  1  , .  .  .  ,  N . 


-4- 


Theorem  1  For  any  design 
.- 2 .  (  (1) 


-  ijtr- )  >  i 


x  .  ( X '  X )  1 X , 
—i  —I 


u|t, . itl  1  -  M«'xr‘ii  ’ 


(5) 


(  4 


•  ) 


where  the  i^,...,it  rows  of  X  are  rows  of 
Theorem  2  If  for  a  design,  the  individual  observations  are  equally 
influential  then 


? 


no 


Uv.)  =  7i,  r  . 

t  i  (N-p) 


(6) 


Theorem  3  If  for  a  design,  the  individual  observations  are  equally 
influential,  then 

n 

(  i  \  .  nd'  t 

(7) 


T  [  ..  (  1  )  »  s,  P*’  t 
1^-2  j  -  (N-p) 


From  (6)  and  (7),  we  observe  that  for  a  design  with  equally 
influential  individual  observations  >.  t 


Second  Method 

The  vector  of  least  squares  fitted  values  of  N  observations  is 
uncorrelated  with  a  complete  set  of  orthonormal  linear  function  of  £ 
with  zero  expectations.  This  fact  implies  the  optimum  property 
"Minimum  Variance"  of  the  vector  of  the  least  squares  fitted  value  as 
an  unbiased  estimator  of  its  expectation.  Assuming  a  set  of  t 
observations  unavailable,  the  least  squares  fitted  values  of  the 
remaining  (N-t)  observations  are  correlated  with  a  complete  set  of 
orthonormal  linear  functions  of  ^(Nxl)  with  zero  expectation  and 
therefore  the  covariance  matrix  is  not  a  null  matrix.  Further  the 
covariance  matrix  is  away  from  the  null  matrix,  the  more  influence  has 
the  set  of  t  observations  on  the  least  squares  fitted  values  of  the 


remaining  (N-t)  observations. 


Let  Z  ((N-p)xN)  be  a  matrix  such  that  Rank  Z  =  (N-p),  ZX  =  0  and 
ZZ'  -  I.  It  can  be  seen  that  Cov(^,ZjO  =  0.  This  implies  that 


has  the  minimum  variance  within  the  class  of  all  unbiased  estimators 


of  E(^)  under  (1-3).  When  t  observations  are  unavailable,  the  least 

(1)  _  v(i> 


squares  fitted  values  are  ^ ' 


We  denote  the  sub- 


■i  Aid) 


matrices  of  Z  corresponding  to  X  j  ^  and  X^^  by  zj^  and  Z^^. 


follows  that  CovC^^.Z^)  =  a2  [x(  1 }  ( xj 1 } '  xj1 )  )  ‘xj^’zj^  ].  The 


(i) 


further  Cov(^  ,Zy)  is  away  from  the  null  matrix,  the  more 

(i) 


influential  Is  the  set  of  t  observations 


We  thus  have  the 


second  measure  of  influence  as 


*2^2^^  =  °2  f-Sum  Squares  of  elements  in  Cov  1  ^ , Zy_)  \ .  (8) 


(i) 


The  largest  value  of  I  2(4°),  for  i=w,  indicates  that  the  wtli  set 


of 


t  observations  is  the  most  influential.  The  following  results  show 

(i) 


some  similarities  between  two  measures  of  influence  )  an^ 


(i) 


h'4  )• 


/  N  , 

Theorem  4  The  following  is  true  for  i=l,...,l  ), 


vfz^V^M  =  vfz^0]. 


(9) 


Theorem  5  The  following  is  true. 


IJy<i))  =  Trace  vfz^--^ 


2' In  1  =  trace  vi ^  '  I.  (10) 

The  equations  (4),  (8)  and  (10)  display  the  similarity  between  two 

measures  of  influence  I  Although  the  matrix  Z  is 

(i) 


not  unique,  it  can  be  checked  that  ^  2^ %-2  ^  un^9ue  f°r  choices 


of  the  matrix  Z. 


IMl 
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Relatlonship  with  Cook's  distance 

Cook  (1977)  proposed  a  distance  function  between  _y^  and  _^(i): 
popular  as  Cook's  distance,  at  the  inference  stage  as 


•d(i) 


-*rW^(i)-V 


(ii) 


p  s 


where  (N-p)S^  =  (^“Xj ) '  (.ZT^ ) •  The  Cook's  distance  measures 
the  degree  of  influence  of  t  observations  in  the  ith  set  on  the 
estimation  of  The  following  result  shows  that  the  first  measure  of 

influence  is  in  fact  related  to  D^,  . 

Theorem  6  From  i4)  and  (11),  we  have 

E(p  dJ  =  IIfj^i)).  (12) 

Examples  are  presented  in  Ghosh  (1987)  to  illustrate  applicability  of 
the  two  proposed  methods. 
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1 . 2  Comparing  Dispersion  Effects  At  Various  Levels  Of  Factors  In  Factorial 

Experiments . 


We  consider  a  2  factorial  experiment  under  a  completely  randomized 
design.  Let  T(nxm)  be  the  design.  Thu  rows  of  T  denote  treatments  and 
the  columns  denote  factors.  The  design  T  is  called  an  inner  array  for  m 
controlled  factors.  For  various  level  combinations  of  noise  factors 
(outer  array),  we  get  replicated  observations  for  every  treatment  in  T 
(see  Taguchi  and  Wu  1985).  In  the  experiment,  we  take  r  (>  1)  observa¬ 
tions  for  every  treatment.  The  case  r  =  1  is  called  the  unreplicated  ex¬ 
periment  and  the  case  r  >  1  is  called  the  replicated  experiment.  Again, 
for  simplicity  equal  replication  is  considered  for  the  replicated  experi¬ 
ment  and  the  Idea  Is  easily  extendable  to  unequal  replications.  Let  y 


ij 


be  the  j th  observation  for  the  ith  treatment,  y^  be  the  mean  of  all 


observations  for  the  treatment  i,  i=l,...,n  and"j=l , . . . r,  and  (N  =  nr)  be 
the  total  number  of  observations.  The  standard  linear  model  for  the 
experiment  is 

e(^)  =  xe,  (i) 

V(£)  «  a2l,  (2) 

Rank  X  =  p,  (J) 

where  ^(N*l)  Is  the  vector  of  observations,  j?(p*l)  Is  t he  vector  of 


factorial  effects  considered  In  the  experiment,  X(Nxp)  is  a  known  matrix 

2 


that  depends  on  the  design  T  and  o  ts  an  unknown  constant.  We  denote 
H  =  X(X'X)  ^X'  and  P.  =  (!->!).  The  vectors  Hy_  and  are  t he  vector  of 
least  squares  fitted  values  and  the  vector  of  residuals,  respectively. 

The  fitted  values  for  all  observations  corresponding  to  the  Ith  treatment 


are  Identical  and  is  denoted  by 


,  t-l,...,n.  Suppose  that  for  the 


fitted  model  to  the  data  there  is  no  significant  lack  of  fit.  The  sum  of 

n  r 

squares  of  error  is  SSE  =  T.  T. 

i=l  J=1 

MSE  =  (SSE/(N-p)),  the  sum  of  squares  of  pure  error  is  SSPE  = 
n  r 
Z  Z 
1=1  j=l 

( SSPE/n ( r- 1 ) ) .  Note  that  both  MfE  and  MSPE  are  measures  of  error  vari- 
2 

ance  o  .  We  now  take  USE  and  MSPE  as  descriptive  measures  of  noise.  We 

then  express  MSPE  as  the  weighted  average  of  (MSPE)^  and  (MSPE)^,  where 

(MSPE)  is  called  the  contribution  of  the  level  u  (u  =  0,1)  of  the 

u 

factor  to  MSPE.  We  do  the  same  for  MSE.  Different  levels  of  a  factor  n.iv 
contribute  differently  to  MSE  and  MSPE.  In  general  t he  contributions  of 
levels  of  a  factor  to  noise  (measured  by  MSPE  or  MSE)  are  called  the 
dispersion  effects  of  levels  of  the  factor. 

We  take  a  single  factor  out  of  m  factors  and  develop  the  methods  of 
measuring  dispersion  effects  at  the  level  0  and  1  of  the  chosen  factor. 

We  do  not  introduce  any  notation  for  the  chosen  factor.  This  is  to  keep 
our  presentation  neat  and  clean.  We  define  for  1=1,..., n, 

[  1  If  the  level  of  the  factor  in  the  ith  treatment  is  l, 

6  =  J 

0  If  the  level  of  the  factor  in  the  ith  treatment  is  0. 

Let  (N*N)  be  a  diagonal  matrix  with  n  sets  of  diagonal  elements  and 
the  elements  in  the  ith  (1=1, ...,n)  set  are  equal  to  5^.  We  define 
0^  =  I-Dj  .  It  can  be  seen  that  1'^  D  =  0  and  both  1)^  and  D  are 
idempotent  matrices.  We  have 


(y  -y^j  and  the  mean  square  of  pure  error  is  MSPE  = 


(>’  -y^)^,  the  mean  square  of  error  is 


x.  R  Dj,  R  X  +  x'  R  D  R  x 


E  £  Sl^yn'>,l|2  +  E  '  (l-«.)(y  -y  )2 

1-1  j-i  ‘  *J  1  1.,  j./  l”yU  V  ’ 


ssre ' JM  6‘ly‘r?i)2  ♦,!, 

Hie  first  set  of  measures  of  dispersion  effects  of  levels  of  the  factor 


E  T‘  My.  «->”« )2 


S 2  ( 1 )  =  — M.L 

1  n 


iw  ij  '  i 


(  £  O  (r-1) 
i  =  l 


S2  ( 1 ) 
(T 


r  S  f  1-6  )l  v  -y  ]■ 
in;li  '  i  ’ 

M  j-1 _ _ _ _ _ 

n 

(  l  {  1-6.  ))  (r-1) 
i  =  l 


at  the  levels  l  and  0,  respectively. 


We  have 


— —  h  'u  i)  +  \  — 


s:.(  i ) 


MS  PE 
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Thus  S2(l)  and  S2 ( 1 )  are  regarded  as  (MSPE)^  and  (MSPE)^  in  the  notation 

2  2 

of  the  previous  section.  If  S^(l)  >  Sy(l),  we  then  say  that  the  level  0 
of  the  factor  has  less  contribution  to  MSPE  and  therefore  would  be 
preferred  to  the  level  l  in  view  of  stability  against  noise  factors. 

We  denote  Rank  R  R  =  V ^  and  Rank  R  L>^  R  =  .  We  now  present  the 

the  second  set  of  measures  of  dispersion  effects  of  levels  of  the  factor 
as 

Sj(2)  =  Cv'  R  D1  R  1)/V[  , 

Sy  (2 )  =  U'  R  Do  R  _y)  /  Vy  .  (5) 

at  the  levels  1  and  0,  respectively.  We  have 


MSF. 


V 

(N-p)J 


si(2)  +  f(N-7)K(2) 


2  2 

We  now  investigate  the  situation  where  S'  fl)  =  S  (2),  u  =  0,1.  In 

u  u 

ot' er  words,  we  like  to  characterize  designs  for  which  y.  =  y^.  We 
denote  the  row  of  the  matrix  X  corresponding  to  the  treatment  i  by 
x^(l*p).  Note  that  for  eacli  i,  l=l,...,n,  the  xow  _x|  is  repeated  r  times 
In  X.  Let  X*(nxp)  be  a  matrix  whose  ith  row  is  _x| .  Notice  that  rows  of 
X*  are  infact  distinct  rows  of  X.  We  have  X’X  =  r(X*'X*). 


Theorem  1.  For  i=l,...,n,  y^ 


y  if  and  only  if  X*(X*'X*)  1  X*' 


I  . 
n 


Corollary.  For  n  =  p,  we  have  S2 ( 1 )  =  S2(2),  u  =  0,1. 

- -  u  u 

We  thus  observe  that  for  designs  with  n  =  p,  two  sets  of  measures  are 
Identical.  The  class  of  designs  with  n  =  p  includes  the  known  Plackett 
and  ilurman  designs  (see  Plackett  and  liurman  196  7 )  .  We  however  strongly 


-11- 


feel  that  this  class  of  designs  is  very  weak  in  view  of  measuring 
dispersion  effects,  particularly  for  the  condition  that  there  is  no 
significant  lack  of  fit. 

We  denote 


where  ^  is  the  vector  of  ali  observations  corresponding  to  treatments 
with  6^  =  u,  u  =  0,1  for  the  chosen  factor.  Two  vectors  of  residuals 
rj  X  anc*  ry  X  aC  the  '  and  0  of  the  factor  are  generally  cor¬ 

related  under  the  model  (1-3).  We  now  present  a  vector  of  "adjusted 
residuals"  at  the  level  0  of  the  factor,  adjusted  w.r.t.  r^  x  so  that 
it  is  uncorrelated  with  r^  x*  We  denote 


H  = 


11 


12 


,  R1 1  = 


R1 11  R1  12 

R  1  1 2  R  1  1  3 


.  R10 


'101 


102 


where  R^ i \  (VjxVi)  with  its  rank  Vj ,  r^  (VjxnJ  with  its  rank  Vj.  In 
fact  we  have  Riu  =  r  j,  |  r  1 1  •  We  now  write 

(6) 


0a 


r  —  R  '  R  ^  r 

0  101  111  11 


It  can  be  seen  that  Rank  r  =  [ ( N — p ) — V  ]  =  V  (say)  and  furthermore, 

0a  1  Oa 

Cov(r^  x»  rQa  y)  =  We  call  r^  x  the  vector  of  "adjusted  residuals" 
at  the  level  0  of  the  factor,  adjusted  w.r.t.  the  residuals  at  the  level 
l  of  the  factor.  We  denote 


Oa 


Oa  1 
Oa  2 


where  r,.  ,  ( V.  *n)  with  its  rank  V,,  .  We  now  have  the  sum  of  squares  of 
Oal  v  Oa  1  Oa 

the  sets  of  linear  functions  r^  and  r^^  [see  Scheffe  1959]  as 

SS(rn  L)  =  X’ru  [tu'hi^n  L  . 

SS(r0al^)  =l'r'Qal  !r0airual  ]_1r0al  L  >  (7) 

with  d.f.  Vj  and  V  ,  respectively.  We  present  the  measures  of 
dispersion  and  adjusted  dispersion  effects  of  levels  of  the  factor 

s2((3)  -  (ss(rn  x)/vi  J  , 

Sfte<3)  ■  [SS<rOal  y.)/V0,I  '  <8> 

at  the  levels  L  and  0  (adjusted  for  level  1),  respectively.  We  have 


(N-p)  /  Sl(3) 


UPF,  S0a<’>- 


Following  the  above  approach  we  find  the  vector  r  of  adjusted 

l  a 

residuals  at  the  level  1  of  the  factor  adjusted  w.r.t.  r^  _y_  so  that  it  is 

uncorrelated  witli  r„  y.  Let  r.  ,  f  V ,  xn)  be  a  submatrix  of  r  such  that 

Rank  r.  ,  =  Rank  r,  =  V,  ,  r,.,(v,  xn)  be  a  submatrix  of  r  with  Rank  rn . 
lal  la  laOlO  0  01 

~  Rank  Iq  =  .  We  again  present  the  measures  of  dispersion  and  adjusted 

dispersion  effects  of  levels  of  the  factor 

sL(3)  ■  lss<r,al  r>/vJ  • 

S2<3)  »  (ss(r01  i)/vo]  ,  (10) 

at  the  levels  1  (adjusted  for  t he  level  0)  and  0,  respectively.  We  have 


MSB  = 


(N-p)  )  Sla(3) 


7 N~-pT /  S0(3)  ’  (N-«’)  “  Vla  +  V0  =  V1  +  V(U) 


fheorem  2.  The  following  results  are  true. 


i.  V,  >  (  I  6,  )  (.  r  -  1  )  ,  V,  >  (  T.  (1  —  6  )  )  (r-1 )  , 


ii.  v  (3)  >  (  y.  6  J(r-l)S*(l), 
ia  la  —  1  1 

V0aS0a(3)  ^ 

11  -  , 
ill.  If  Vla  =  (  Z  6 1 J  ( I  - 1 )  then  Sja(3)  =  S‘(l)  , 

i  =  l 

Iv.  If  V(Ja  =  (  Z  (1-6  )j(r-l)  then  S*  (3)  =  sj(l). 
i  =1  '* 

We  now  study  the  measures  in  two  extreme  situations:  (1)  rj  X  anc* 
r  y  are  uncorrelated,  i.e.,  R  =  0,  (ii)  r  y  and  r  y  are  completely 
correlated,  i.e.,  r(^  =  D  r^  for  some  matrix  I). 

Theorem  3.  Consider  the  situation  R,„  =  0.  Then  S2 ( 3 )  =  S 2 C 2 )  =  S?  (3), 
- — -  10  u  u  ua 

u=0 , 1 . 

Theorem  A.  If  r.,.  =  D  r..  then  we  have  r„  =  U,  V  =  0  and  SS(r  N  ,y) 

— - —  01  11  0a  0a  Oal"^ 

=  0. 

Theorem  3  tells  that  in  case  Rio  =  0  there  Is  no  need  for  the  adjustment 

of  residuals.  Theorem  A  tells  that  in  case  r^  £is  linearly  dependent  on 

dependent  on  r^  ^  then  the  level  1  of  the  factor  makes  all  contribution 

to  SSE  and  the  level  0  does  not  make  any  additional  contribution  to  SSE. 

In  case  V  =  V,  =  0,  we  have  V/x  =  V,  =  (N-p),  r  ,  =  D  r,  ,  and  D  is 
0a  la  0  1  01  11 

nonsingular.  This  is  a  situation  where  the  levels  0  and  1  have  equal 

dispersion  effects  because  of  the  design  influence.  It  follows  from 

Theorem  2  that  for  r  >  1,  and  V  are  both  nonzero.  (We  assume 

naturally  that  there  Is  at  least  one  <5^  =  1  and  at  least  one  (l-<5^)=  1.) 

For  tile  case  r  =  1,  at  least  one  of  V  and  V  could  be  zero  or  both  of 

0a  1  a 


them  could  be  nonzero.  We  now  consider  the  important  situation  when 


both  V Qa  and  V ^  are  nonzero.  If  S^(3)  >  Maximum  (Sq(3),  S^a C 3 ) ) ,  then 

the  level  0  of  the  factor  has  an  advantage  edge  over  the  level  1.  On  the 

other  hand  if  S^(3)  <  Minimum(  ( 3 ) ,  S^(3)),  the  level  I  of  the  factor 

is  superior  to  the  level  0  in  terms  of  smaller  dispersion  effects. 

We  now  state  some  properties  of  the  descriptive  measures  proposed  in 

Section  3  under  the  model  (1-3).  We  first  observe  that  the  measures 
2  2 

Sj(l)  and  Sq(1)  do  not  depend  on  the  fitted  model  and  all  other  measures 

2  2 

depend  on  the  fitted  model.  The  measures  S^(l)  and  Sq(1)  are  always 

2  2 

uncorrelated  under  the  model  (1-3).  The  measures  S  ^ ( 2 )  and  Sq(2)  may 
however  be  correlated.  Two  sets  of  linear  functions  of  observations  Dj  R 
and  D0  R  £  are  uncorrelated  if  and  only  if  D j  R  Dq  =  0.  Therefore  if  Dj 

9  2 

R  Dq  =  0  then  S^(2)  and  Sq(2)  are  uncorrelated.  We  have  the 
following  results: 

2 

Theorem  5.  Suppose  %  N(x_S,o  I).  A  necessary  and  sufficient  condition 
that 

i'  R  Di  R  X  ? 

(1)  — - - - n.  Central  Xz  with  d.f.  =  Trace  Rjj, 

a 

t  R  D0  R  z  2 

(2)  ■ - - -  %  Central  X  with  d.f.  =  Trace  Roo» 

o/ 

(3)  and  furthermore,  (1)  and  (2)  are  statistically  independent, 
is  that  Rio  =  0 

Notice  that  Di  R  Dq  =  0  if  and  only  if  Rio  =  <>.  Moreover,  Vi  +  Vo  = 

(N-p)  if  Rio  =  0  and  Vi  +  V0  could  be  greater  than  (N-p)  if  Rio  *  0.  We 

y  2 

question  the  use  of  estimators  S'j(2)  and  S^(2)  for  comparison  unless 
R 1 0  =  0.  We  of  course  realize  that  the  condition  Rio  =  0  is  too 
stringent  to  satisfy  even  for  one  out  of  m  factors. 
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Theorem  6.  The  following  results  are  true. 

a*  1  1  6i(yi1-y1)  =  1  1  ( i-fi{)(yn-yJ  =  o, 

i=i  j=i  J  i=i  j=i  1  J  1 

b.  If  for  the  factor  Rio  =  0,  then 

b.l.  Rn  and  Rgg  are  idempotent  matrices, 

b * 2 *  =  Ruu  V  u  "  °*1* 

b . 3 .  X'  R  =  0,  u  =  0,1, 
u  uu 

b-4  1  1  6iyi(yii-yJ  =  1  1  ( 1_6<  )y.(yn-yt)  =  o. 

i=l  j=l  J  i  =  l  j  =  l  1  1  1 

2  2 

The  measures  S^(3)  and  S^_u^g(3),  u  =  0,1,  are  always  uncorrelated. 
The  reason  for  adjusting  residuals  is  to  obtain  uncorrelated  dispersion 
effects. 
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In  the  problem  of  comparison  of  dispersion  effects  at  levels 
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